41 research outputs found
Pure Spectral Graph Embeddings: Reinterpreting Graph Convolution for Top-N Recommendation
The use of graph convolution in the development of recommender system
algorithms has recently achieved state-of-the-art results in the collaborative
filtering task (CF). While it has been demonstrated that the graph convolution
operation is connected to a filtering operation on the graph spectral domain,
the theoretical rationale for why this leads to higher performance on the
collaborative filtering problem remains unknown. The presented work makes two
contributions. First, we investigate the effect of using graph convolution
throughout the user and item representation learning processes, demonstrating
how the latent features learned are pushed from the filtering operation into
the subspace spanned by the eigenvectors associated with the highest
eigenvalues of the normalised adjacency matrix, and how vectors lying on this
subspace are the optimal solutions for an objective function related to the sum
of the prediction function over the training data. Then, we present an approach
that directly leverages the eigenvectors to emulate the solution obtained
through graph convolution, eliminating the requirement for a time-consuming
gradient descent training procedure while also delivering higher performance on
three real-world datasets
Stratification structure of urban habitats
This paper explores the community structure of a network
of significant locations in cities as observed from location-based social
network data. We present the findings of this analysis at multiple spatial
scales. While there is previously observed distinct spatial structure at
inter-city level, in a form of catchment areas and functional regions,
the exploration of in-city scales provides novel insights. We present the
evidence that particular areas in cities stratify into distinct “habitats”
of frequently visited locations, featuring both spatially overlapping and
disjoint regions. We then quantify this stratification with normalized
mutual information which shows different stratification levels for different
cities. Our findings have important implications for advancing models
of human mobility, studying social exclusion and segregation processes
in cities, and are also of interest for geomarketing analysts developing
fidelity schemes and promotional programmes
FewSOME: One-Class Few Shot Anomaly Detection with Siamese Networks
Recent Anomaly Detection techniques have progressed the field considerably
but at the cost of increasingly complex training pipelines. Such techniques
require large amounts of training data, resulting in computationally expensive
algorithms that are unsuitable for settings where only a small amount of normal
samples are available for training. We propose 'Few Shot anOMaly detection'
(FewSOME), a deep One-Class Anomaly Detection algorithm with the ability to
accurately detect anomalies having trained on 'few' examples of the normal
class and no examples of the anomalous class. We describe FewSOME to be of low
complexity given its low data requirement and short training time. FewSOME is
aided by pretrained weights with an architecture based on Siamese Networks. By
means of an ablation study, we demonstrate how our proposed loss, 'Stop Loss',
improves the robustness of FewSOME. Our experiments demonstrate that FewSOME
performs at state-of-the-art level on benchmark datasets MNIST, CIFAR-10,
F-MNIST and MVTec AD while training on only 30 normal samples, a minute
fraction of the data that existing methods are trained on. Moreover, our
experiments show FewSOME to be robust to contaminated datasets. We also report
F1 score and balanced accuracy in addition to AUC as a benchmark for future
techniques to be compared against. Code available;
https://github.com/niamhbelton/FewSOME
Geometry of Empty Space is the Key to Near-Arrest Dynamics
We study several examples of kinetically constrained lattice models using
dynamically accessible volume as an order parameter. Thereby we identify two
distinct regimes exhibiting dynamical slowing, with a sharp threshold between
them. These regimes are identified both by a new response function in
dynamically available volume, as well as directly in the dynamics. Results for
the selfdiffusion constant in terms of the connected hole density are
presented, and some evidence is given for scaling in the limit of dynamical
arrest.Comment: 11 page
Generating Personalised and Opinionated Review Summaries
Abstract. This paper describes a novel approach for summarising usergenerated reviews for the purpose of explaining recommendations. We demonstrate our approach using TripAdvisor reviews
Item Graph Convolution Collaborative Filtering for Inductive Recommendations
Graph Convolutional Networks (GCN) have been recently employed as core
component in the construction of recommender system algorithms, interpreting
user-item interactions as the edges of a bipartite graph. However, in the
absence of side information, the majority of existing models adopt an approach
of randomly initialising the user embeddings and optimising them throughout the
training process. This strategy makes these algorithms inherently transductive,
curtailing their ability to generate predictions for users that were unseen at
training time. To address this issue, we propose a convolution-based algorithm,
which is inductive from the user perspective, while at the same time, depending
only on implicit user-item interaction data. We propose the construction of an
item-item graph through a weighted projection of the bipartite interaction
network and to employ convolution to inject higher order associations into item
embeddings, while constructing user representations as weighted sums of the
items with which they have interacted. Despite not training individual
embeddings for each user our approach achieves state of-the-art recommendation
performance with respect to transductive baselines on four real-world datasets,
showing at the same time robust inductive performance
Can We Transfer Noise Patterns? An Multi-environment Spectrum Analysis Model Using Generated Cases
Spectrum analysis systems in online water quality testing are designed to
detect types and concentrations of pollutants and enable regulatory agencies to
respond promptly to pollution incidents. However, spectral data-based testing
devices suffer from complex noise patterns when deployed in non-laboratory
environments. To make the analysis model applicable to more environments, we
propose a noise patterns transferring model, which takes the spectrum of
standard water samples in different environments as cases and learns the
differences in their noise patterns, thus enabling noise patterns to transfer
to unknown samples. Unfortunately, the inevitable sample-level baseline noise
makes the model unable to obtain the paired data that only differ in
dataset-level environmental noise. To address the problem, we generate a
sample-to-sample case-base to exclude the interference of sample-level noise on
dataset-level noise learning, enhancing the system's learning performance.
Experiments on spectral data with different background noises demonstrate the
good noise-transferring ability of the proposed method against baseline systems
ranging from wavelet denoising, deep neural networks, and generative models.
From this research, we posit that our method can enhance the performance of DL
models by generating high-quality cases. The source code is made publicly
available online at https://github.com/Magnomic/CNST
Clarification of the Bootstrap Percolation Paradox
We study the onset of the bootstrap percolation transition as a model of
generalized dynamical arrest. We develop a new importance-sampling procedure in
simulation, based on rare events around "holes", that enables us to access
bootstrap lengths beyond those previously studied. By framing a new theory in
terms of paths or processes that lead to emptying of the lattice we are able to
develop systematic corrections to the existing theory, and compare them to
simulations. Thereby, for the first time in the literature, it is possible to
obtain credible comparisons between theory and simulation in the accessible
density range.Comment: 4 pages with 3 figure
Combining Rating and Review Data by Initializing Latent Factor Models with Topic Models for Top-N Recommendation
The 14th ACM Recommender Systems conference (RecSys '20), Virtual Event, 22-26 September 2020Nowadays we commonly have multiple sources of data associated with items. Users may provide numerical ratings, or implicit interactions, but may also provide textual reviews. Although many algorithms have been proposed to jointly learn a model over both interactions and textual data, there is room to improve the many factorization models that are proven to work well on interactions data, but are not designed to exploit textual information. Our focus in this work is to propose a simple, yet easily applicable and effective, method to incorporate review data into such factorization models. In particular, we propose to build the user and item embeddings within the topic space of a topic model learned from the review data. This has several advantages: we observe that initializing the user and item embeddings in topic space leads to faster convergence of the factorization algorithm to a model that out-performs models initialized randomly, or with other state-of-the-art initialization strategies. Moreover, constraining user and item factors to topic space allows for the learning of an interpretable model that users can visualise.Science Foundation IrelandInsight Research Centre2020-10-06 JG: PDF replaced with correct versio